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Abstract 

The  formalism  of  probabilistic  languages  has  been  introduced  for  modeling  the  qualitative 
behavior  of  stochastic  discrete  event  systems.  A  probabilistic  language  is  a  unit  interval 
valued  map  over  the  set  of  traces  of  the  system  satisfying  certain  consistency  constraints. 
Regular  language  operators  such  as  choice,  concatenation,  and  Kleene-closure  have  been 
defined  in  the  setting  of  probabilistic  languages  to  allow  modeling  of  complex  systems  in 
terms  of  simpler  ones.  The  set  of  probabilistic  languages  is  closed  under  such  operators  thus 
forming  an  algebra.  It  also  is  a  complete  partial  order  under  a  natural  ordering  in  which 
the  operators  are  continuous.  Hence  recursive  equations  can  be  solved  in  this  algebra.  This 
is  alternatively  derived  by  using  contraction  mapping  theorem  on  the  set  of  probabilistic 
languages  which  is  shown  to  be  a  complete  metric  space.  The  notion  of  regularity,  i.e., 
finiteness  of  automata  representation  of  probabilistic  languages  has  been  defined  and  shown 
that  regularity  is  preserved  under  choice,  concatenation,  and  Kleene-closure.  We  show  that 
this  formalism  is  also  useful  in  describing  system  performances  such  as  completion  time, 
reliability,  etc.  and  present  properties  to  aide  their  computation. 

Keywords:  Discrete  Event  Systems,  Stochastic  Systems,  Automata,  Languages,  Regularity, 
Completion  time,  Reliability 


1  Introduction 


Discrete  event  systems  are  systems  which  involve  quantities  that  are  discrete  and  which 
change  when  events  occur  in  the  system.  Most  prior  work  on  characterizing  qualitative 
behavior  of  such  event  driven  systems  has  been  restricted  to  non-stochastic  systems  where 
the  emphasis  is  on  possibilities  rather  than  on  probabilities.  The  theory  of  time  driven 
stochastic  systems  on  the  other  hand  has  been  well  developed  and  understood.  In  this  paper 
we  introduce  the  formalism  of  probabilistic  languages  to  describe  the  behavior  of  stochastic 
discrete  event  systems.  Such  systems  are  also  represented  as  nondeterministic  automata 
with  probabilities  associated  with  transitions. 

Consider  for  example  a  machine  which  operates  in  one  of  the  three  possible  states — 
idle,  working,  and  broken.  In  the  idle  state,  the  machine  state  changes  to  the  working- 
state  with  probability  one  when  the  “commence”  event  occurs.  In  the  working  state,  the 
machine  state  changes  to  the  broken  state  with  probability,  say,  p  when  the  “breakdown” 
event  occurs,  whereas  it  changes  to  the  idle  state  with  the  remainder  probability  1  —  p  when 
the  “completion”  event  is  executed.  Finally,  in  the  broken  state,  the  machine  state  changes 
to  the  idle  state  with  probability  one  when  the  “repair”  event  is  executed. 

Such  a  stochastic  discrete  event  system  can  clearly  be  represented  by  an  automaton  with 
probabilities  associated  with  transitions  such  that  the  probabilities  of  all  transitions  from 
any  state  add  up  to  at  most  one.  When  the  probabilities  do  not  add  up  to  exactly  one,  this 
means  the  systems  terminates  with  the  remainder  probability.  Thus  by  modeling  systems 
by  “substochastic  automata”  we  model  the  possibility  of  termination,  which  also  allows  us 
to  model  concatenation  or  sequential  operation  of  two  or  more  systems.  The  substochastic 
automata  model  can  be  converted  to  an  equivalent  stochastic  automata  model  by  introducing 
an  auxiliary  “termination”  event  and  assigning  appropriate  probability  to  each  “termination” 
transition  so  that  the  transition  probabilities  from  any  state  add  up  to  exactly  one.  Thus 
this  model  is  similar  to  the  well  known  Markov  chain  models  [2],  The  difference  lies  in  the 
fact  that  unlike  the  Markov  chain  models  the  transition  event  labels  are  emphasized  so  that 
we  can  study  the  event  traces  and  their  occurrence  probabilities.  The  work  on  stochastic 
Petri  nets  [13]  also  follows  the  work  of  Markov  chains  and  is  thus  different  from  the  work 
presented  here. 

Our  model  also  differs  from  Rabin’s  probabilistic  automata  model  [17]  where  the  proba¬ 
bilities  on  transitions  on  each  event  (rather  than  all  events)  from  any  state  add  up  to  one. 
Thus  in  Rabin’s  model  system  changes  its  state  on  each  event  with  probability  one.  How¬ 
ever  in  our  model  the  cumulative  probability  of  state  change  over  all  events  is  at  most  one. 
Rabin’s  model  is  weaker  in  the  sense  that  less  information  about  state  change  on  an  event 
occurrence  is  available.  The  motivation  for  Rabin’s  work  was  to  introduce  the  notion  of 
“cut-languages” .  A  cut-language  is  the  set  of  accepted  event  traces  whose  occurrence  proba¬ 
bility  exceeds  a  given  cut  value.  Our  motivation  here  is  to  study  the  probabilistic  languages 
associated  with  models  bearing  similarity  to  Markov  chain  models.  Thus  a  twofold  difference 
lies  between  Rabin’s  probabilistic  automata  and  the  model  presented  here.  Mortazavian  [15] 
considered  the  supervisory  control  problem  for  systems  modeled  as  Rabin’s  probabilistic  au- 
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tomata.  Thus  our  work  is  also  distinct  from  that  of  Mortzavian.  There  have  been  other 
attempts  to  generalize  deterministic  automata  to  the  probabilistic  setting  [16,  4]  which  also 
follow  the  lines  of  Rabin’s  model. 

As  is  defined  below  a  probabilistic  language  associates  a  probability  (a  value  in  the  unit 
interval)  to  each  finite  length  trace  of  the  system  with  the  following  two  constraints  on  the 
probabilities:  (i)  the  probability  of  the  zero  length  trace  is  one,  and  (ii)  the  probability  of 
any  trace  is  at  the  least  the  cumulative  probability  of  all  its  extensions.  In  [8]  a  probability 
measure  on  the  set  of  traces  is  defined  to  be  a  probability  map  satisfying  the  first  constraint 
above  and  a  stronger  second  constraint  that  requires  that  the  probability  of  each  trace  be 
equal  the  cumulative  probability  of  all  its  extensions.  By  relaxing  this  constraint  we  are 
able  to  model  the  possibility  of  termination,  which  also  allows  us  to  define  concatenation  or 
sequential  operation  of  two  or  more  stochastic  discrete  event  systems. 

A  probabilistic  language  can  be  viewed  a  formal  power  series  [20] — a  map  over  the  set  of 
traces,  but  with  the  additional  two  constraints  described  above.  There  are  many  important 
differences  in  our  work  from  the  classical  work  on  formal  series.  Our  definition  of  concatena¬ 
tion  (product)  and  therefore  of  concatenation-closure  is  quite  different  from  that  of  classical 
definition  of  product.  The  classical  definition  of  product,  called  Cauchy  product,  is  the  same 
as  the  convolution  operator  used  in  this  paper.  Our  definition  of  product  is  more  useful  in 
modeling  concatenation  or  sequential  operation  of  two  or  more  systems. 

Another  early  attempt  to  associate  probabilities  with  traces  was  done  by  researchers  in 
fuzzy  set  theory  [12],  A  fuzzy  language  is  a  fuzzy  set  of  event  traces,  so  that  each  trace  has  a 
membership  grade  in  the  unit  interval  [0, 1].  Our  definition  of  probabilistic  languages  allows 
only  those  membership  grades  which  satisfy  certain  consistency  constraints.  A  consequence 
of  these  constraints  is  that  the  membership  grade  of  a  trace  can  be  viewed  as  the  probability 
that  the  system  executes  that  trace.  This  results  in  a  richer  theory  for  stochastic  discrete 
event  system  modeled  by  probabilistic  languages. 

We  define  regular  language  operators  such  as  choice,  concatenation,  and  Ivleene-closure 
over  the  domain  of  probabilistic  languages  to  allow  modeling  of  complex  systems  in  terms  of 
simpler  ones.  We  show  that  the  set  of  probabilistic  languages  is  closed  under  such  operators, 
thus  forming  an  algebra.  We  show  that  this  set  is  a  well  structured  poset — a  complete  partial 
ordef — under  a  natural  partial  order  in  which  the  operators  are  continuous.  Hence  recursive 
equations  can  be  solved  in  this  algebra.  We  also  establish  this  result  alternatively  by  defining 
a  metric  on  the  set  of  probabilistic  languages  in  which  the  set  is  complete  and  the  operators 
are  continuous  and  contracting.  Consequently,  the  contraction  mapping  theorem  applies 
yielding  existence  of  unique  fixed  points  of  the  operators.  We  characterize  the  regularity  of 
probabilistic  languages,  i.e. ,  finiteness  of  their  automaton  representation  and  show  that  the 
operators  preserve  regularity.  Thus,  probabilistic  languages  forms  a  suitable  formalism  for 
modeling  and  analysis  of  stochastic  discrete  event  systems. 

We  show  that  probabilistic  language  formalism  is  also  useful  in  describing  system  perfor¬ 
mances  such  as  average  completion  time  and  reliability.  We  present  analytical  techniques  to 
aide  in  computation  of  “bilinear”  performance  functions  by  showing  that  they  enjoy  several 
compositional  properties  so  that  performances  of  complex  systems  can  be  easily  obtained 


2 


in  terms  of  those  of  their  simpler  constituent  sub-systems.  Computation  of  performance 
functions  by  recursion  over  regular  operations  was  first  investigated  in  [14].  Our  results 
extend  this  technique  to  recursion  over  probabilistic  regular  operations.  The  formalism  of 
probabilistic  languages  can  also  be  used  for  studying  supervisory  control  problems  [18,  11] 
of  stochastic  discrete  event  systems.  The  concluding  section  discusses  some  ways  in  which 
control  problems  can  be  studied  in  this  formalism.  More  detailed  examples  and  algorithms 
are  subject  of  work  under  progress. 

The  rest  of  the  paper  is  organized  as  follows.  Section  2  describes  the  notation  in  this 
paper.  Section  3  defines  probabilistic  languages  and  introduces  a  partial  order  on  the  class 
of  probabilistic  languages  making  it  a  complete  partial  order.  Section  4  describes  various 
operators  defined  for  probabilistic  languages  and  studies  their  properties.  Section  5  defines 
probabilistic  automata  and  relates  them  to  probabilistic  languages  that  are  regular.  Section  6 
defines  system  performances  and  describes  compositional  methods  to  compute  them.  Section 
7  concludes  the  work  presented  here  and  discusses  some  control  problems  that  can  be  studied 
in  this  formalism.  Appendix  A  gives  a  metric  in  which  the  set  of  probabilistic  languages  is 
complete  and  the  operators  are  continuous  and  contracting. 

2  Notation  and  Preliminaries 

We  use  E  to  denote  the  universe  of  events.  E*  denotes  the  set  of  all  finite  length  sequences 
of  events  from  E,  including  the  zero  length  trace  e.  A  member  of  E*  is  called  a  trace,  and 
a  subset  of  it  is  called  a  language.  Given  traces  s  and  t,  we  use  s  <  t  to  denote  that  s-  is 
a  prefix  of  t.  in  which  case  the  notation  s~1t  is  used  to  denote  the  suffix  of  t.  obtained  by 
removing  the  prefix  s,  i.e. ,  t.  =  ss_1t.  A  language  is  called  prefix  closed  if  it  contains  all 
prefixes  of  all  of  its  traces.  Thus  a  prefix  closed  non-empty  language  always  contains  e. 

A  nondeterministic  automaton  G  over  the  event  set  E  is  a  triple  G  :  =  (A",  x0l  8),  where 
A"  is  the  set  of  states,  x0  G  X  is  the  initial  state,  and  8  :  X  x  E  — >■  2X  is  the  partial 
nondeterministic  transition  function  which  gives  the  set  of  possible  resulting  states  when  a 
certain  event  in  a  given  state  occurs.  For  more  details  on  languages  and  automata  refer  to 

[9]- 

Given  a  set  A",  a  partial  order  on  A",  denoted  A,  is  a  binary  relation  that  is  reflexive, 
anti-symmetric,  and  transitive.  The  pair  (A".  A)  is  called  a  partially  ordered  set  or  a  poset. 
Given  a  poset  (A,  A)  and  a  pair  of  elements  i,t/G  X,  their  infimum  and  supremum  whenever 
defined  are  unique  and  are  denoted  by  x\ly  and  x\Jy,  respectively.  The  poset  (A",  A)  is  called 
an  inf  semi-lattice  (respectively,  a  sup  semi-lattice)  whenever  the  infimum  (respectively, 
supremum)  is  defined  for  any  pair  of  poset  elements.  It  is  called  a  lattice  if  it  is  both  inf  as 
well  as  sup  semi-lattice,  x  G  X  is  called  the  bottom  (respectively,  the  top)  element  if  x  <  y 
(respectively,  y  <  x)  for  each  y  G  X. 

A  chain  in  a  poset  is  a  monotonically  increasing  sequence  of  poset  elements.  I.e.,  {:£*};> o, 
where  Xi  G  A"  for  each  i,  is  called  a  chain  if  Xi  A  Xj  whenever  i  <  j.  A  poset  (A,  A)  is  called 
a  complete  partial  order  (cpo)  if  it  contains  the  bottom  element  of  A",  and  the  supremum 
element  of  each  chain. 
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Given  a  poset  (X,  A),  a  function  f  :  X  X  is  called  monotone  if  the  partial  ordering  is 
preserved  under  its  transformation,  i.e.,  x  <  y  implies  f(x)  A  f(y).  The  function  is  called 
continuous  if  it  commutes  with  the  supremum  operation  taken  over  a  chain,  i.e.,  given  a 
chain  {a’,;},  continuity  of  /  requires  that  /(L^a*)  =  U if(xi).  Note  that  for  continuity  of  /  to 
be  defined,  X  must  be  a  cpo  (so  that  Ua,  is  defined  for  a  chain  {a*}).  It  is  easy  to  show  that 
every  continuous  function  is  also  monotone. 

a  G  X  is  called  a  fixed  point  of  /  :  X  — >■  X  if  /(a)  =  a.  It  is  known  that  every 
continuous  function  defined  over  a  cpo  ( X ,  A)  possesses  a  infimum  fixed  point  which  is  given 
by  U,;>o/'(-L),  where  _L  denotes  the  bottom  element  of  A",  f 1  denotes  the  Afold  application 
of  /,  and  /°  is  the  identity  function.  Thus  recursive  equations  of  the  type  /(a)  =  a  can 
be  solved  whenever  /  is  continuous  and  is  defined  on  a  cpo.  We  use  this  fact  to  define 
recursive  equations  in  the  domain  of  probabilistic  languages.  For  further  reading  on  posets 
and  functions  defined  on  posets  refer  to  [3]. 

Given  a  set  12,  a  a- algebra  on  Q  is  a  set  of  subsets  of  Q  such  that  it  is  closed  under 
complementation  and  countable  union,  and  contains  12.  The  set  of  a- algebras  on  12  is  closed 
under  arbitrary  intersection.  So  given  a  collection  of  subsets  of  12,  there  exists  the  smallest 
(7-algebra  containing  the  given  subsets.  This  is  called  the  er-algebra  generated  by  the  given 
subsets.  A  measurable  space  is  a  pair  (12,  JF),  where  12  is  an  arbitrary  set,  called  a  sample 
space ,  and  T  is  a  a- algebra  on  12,  each  member  of  which  is  called  a  measurable  set.  Given 
a  measurable  space  (12,  T),  a  probability  measure  P  on  it  is  map  P  :  T  — >■  [0, 1]  that  maps 
(i)  12  to  one,  and  (ii)  satisfies  a-additivity,  i.e.,  probability  measure  of  countable  union  of 
pairwise  disjoint  sets  in  T  equals  the  sum  of  their  individual  probability  measures.  The 
triple  (12,  JF,  P)  is  called  a  probability  space.  For  further  detail  on  measurable  spaces  and 
probability  measures  refer  to  [21], 

3  Probabilistic  Languages 

A  language  can  be  viewed  as  a  binary  valued  map  over  the  set  of  traces  in  E*  which 
assigns  a  unit  value  to  a  trace  if  and  only  if  the  trace  belongs  to  the  language.  If  this 
language  represents  the  qualitative  behavior  of  a  certain  discrete  event  system,  then  it  must 
be  non-empty  and  prefix  closed,  i.e.,  the  zero  length  trace  must  be  mapped  to  one,  and 
whenever  a  trace  is  mapped  to  one  any  of  its  prefix  must  also  be  mapped  to  one. 

We  view  the  probabilistic  language  of  a  stochastic  discrete  event  system  in  a  similar 
manner  which  assigns  a  probability  measure  (a  value  in  the  unit  interval)  to  each  trace  in  E* 
with  the  interpretation  that  this  value  determines  its  probability  of  its  occurrence.  Since  a 
zero  length  trace  is  always  possible  in  a  system,  its  probability  measure  must  be  one.  Also, 
for  a  trace  to  occur,  all  its  prefixes  must  occur  first.  So  the  cumulative  probability  of  all 
traces  sharing  a  common  prefix  should  not  exceed  the  probability  of  the  prefix  itself.  This 
captured  by  properties  PI  and  P2  below. 

In  order  to  represent  the  qualitative  behavior  of  a  stochastic  discrete  event  system,  we 
first  describe  the  underlying  measurable  space.  For  mathematical  convenience  we  explicitly 
represent  the  occurrence  of  termination  by  augmenting  the  event  set  with  the  “termination 
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event”,  denoted  oA.  Then  the  sample  space  for  the  behavior  of  a  stochastic  discrete  event 
system  is  given  by  the  set  of  all  finite  length  traces  possibly  followed  by  the  termination  event, 
i.e. ,  Q  =  E*(oA  +  e)  =  EVaUE*.  Since  we  are  interested  in  the  probability  of  “occurrence  of 
a  trace  s  G  Q” — which  occurs  whenever  any  trace  containing  s  as  its  prefix  executes,  we  let 
the  set  of  all  traces  having  s  as  a  prefix  be  a  measurable  set.  We  use  <  s  >:=  {st  \  st  G  0} 
(and  simply  s  when  there  is  no  contextual  confusion)  to  denote  this  measurable  set.  Then 
the  set  of  all  measurable  sets  T  is  the  er-algebra  generated  by  {<  s  >|  s  G  Q}. 

Definition  1  Consider  the  measurable  space  (D,  JF),  where  O  =  E*(EA  +  e)  and  T  is  the 
ex-algebra  generated  by  {<  s  >\  s  G  D}.  Then  a  probabilistic  language  (p-language)  L  is  a 
probability  measure  on  the  measurable  space  (O,^7),  i.e.,  it  is  a  map  L  :  T  — >■  [0, 1]  that 
maps  O  to  one  and  satisfies  the  property  of  cr-additivity. 

Remark  1  Since  (i)  the  probability  measure  L  on  the  a- algebra  T  is  uniquely  determined 
by  its  value  on  the  set  {<  s  >]  s  G  0}  C  T  which  generates  the  a- algebra  T .  and  (ii)  the 
set  {<  s  >|  s  G  0}  has  the  obvious  one  to  one  correspondence  with  the  set  O,  we  can  view 
L  to  be  a  unit  interval  valued  map  on  Q.  With  this  correspondence  in  mind  for  each  s  G  Q, 
we  use  L(s)  to  represent  the  probability  measure  L(<  s  >),  and  refer  to  it  as  the  probability 
of  occurrence  of  trace  s. 

Next,  due  to  the  property  of  a-additivity  of  L,  we  have 

Vs  G  E*  :  L{sa A)  =  L{s)  -  ^  L{sa),  (1) 

ae  s 

where  L(si ja)  represents  the  probability  of  termination  following  .s,  L(s)  represents  the 
probability  of  occurrence  of  s,  and  represents  the  probability  of  continuous 

operation  beyond  s.  Note  that  in  Equation  (1)  s  G  E*,  so  sc ta  G  E*<ta  and  so  G  E*  for  each 
o  G  E.  Thus  the  value  of  L  on  E*<ta  is  uniquely  determined  by  its  vale  on  E*.  Hence  it 
suffices  to  view  L  as  a  unit  interval  valued  map  on  E*  (rather  than  on  its  superset  fi),  which 
we  do  from  here  on. 

For  future  notational  convenience  we  define 

Vs  G  E*  :  A (L)(s)  :=  L{soA);  A (L)(s)  :=  L(sa ) 

<tGS 

to  be  the  probability  of  termination  and  probability  of  continuous  operation,  respectively, 
following  the  execution  of  trace  s.  Then  it  is  clear  that  for  any  L,  A (L)  and  A(L)  are  both  unit 
interval  valued  maps  on  E*.  The  Equation  (1)  can  be  concisely  written  as:  A(L)  =  L  —  A(L). 

Remark  2  Consider  the  measurable  space  (fh  T)  as  given  in  Definition  1,  and  a  p-language 
L  :  T  — >■  [0, 1].  Then  it  is  easy  to  see  that  the  following  two  properties  hold  when  L  is  viewed 
as  a  unit  interval  valued  map  on  E*: 

PI:  L(e)  =  1 
P2:  L  -  A (L)  >  0 
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PI  follows  from  the  fact  that  L(e)  =  L(<  e  >)  =  L(Q)  =  1,  whereas  P2  follows  from  the 
fact  that  for  each  s  G  £*,  L(s)  —  A (L)(s)  =  L(sa^)  >  0. 

Conversely,  given  a  map  L  :  E*  — >■  [0, 1]  satisfying  PI  and  P2  it  can  be  extended  in 
the  following  manner  to  obtain  a  probability  measure  on  (Q,JF):  First  Equation  (1)  is 
used  to  extend  L  from  E*  to  Q.  Next  for  any  s  G  Q,  the  probability  measure  of  the  set 
<  s  >G  T  is  simply  given  by  L(s).  Finally,  the  probability  measure  for  the  collection  of  sets 
{<  s  >|  s  G  fi}  to  obtain  the  probability  measure  of  any  set  in  T  by  applying  the  property 
of  (T-additivity. 

The  above  observation  motivates  the  following  the  simplified  definition  of  a  p-language 
describing  the  qualitative  behavior  of  a  stochastic  discrete  event  system. 

Definition  2  A  p-language  L  is  a  unit  interval  valued  map  on  E*  satisfying  PI  and  P2.  We 
use  C  to  denote  the  set  of  all  p-languages. 

Since  the  qualitative  behavior  of  a  stochastic  discrete  event  system  is  completely  described 
by  its  p-language,  the  terms  “system”  and  “p-language”  are  used  below  synonymously. 

Remark  3  A  consequence  of  P2  is  that  L(s)  >  L(t )  whenever  s  is  a  prefix  of  t.  Also  note 
that  for  any  trace  s,  L(s)  <  1  can  also  be  derived  from  PI  and  P2.  However,  we  prefer  to 
keep  it  as  part  of  the  definition  for  simplicity. 

Example  1  Consider  a  system  that  deadlocks  initially.  We  use  nil  p-language  I  to  represent 
its  behavior.  Then  1(e)  =  1,  and  I(s)  =  0  for  s  ^  e.  It  is  easy  to  verify  that  PI  and  P2  hold 
for  /;  and  A  (I)  =  I. 

Consider  a  Bernoulli  process  where  each  experiment  has  two  outcomes  a  and  b  with 
probabilities  p  and  (1  —  p).  respectively.  Here  the  event  set  E  =  {a,  b},  and  the  associated 
p-language  L  is  given  by: 


Vs  G  E*  :  L(s)  =  p#M(  1  -  p)#^s\ 

where  #(a,  s)  represents  the  number  of  occurrences  of  a  in  the  trace  s.  It  is  clear  that  PI 
and  P2  hold  for  L;  and  A (L)  =  0. 

Note  that  the  outcomes  corresponding  to  terminations  after  distinct  traces  are  mutually 
exclusive.  Hence  the  cumulative  probability  of  termination  of  a  system  can  be  obtained  by 
adding  the  individual  probabilities  over  all  possible  traces:  ^(L)(s).  For  the  system  with 

nil  language  this  cumulative  probability  equals  one  which  implies  that  it  is  a  terminating 
system.  On  the  other  hand,  for  the  Bernoulli  process  this  cumulative  probability  is  zero 
implying  that  this  system  does  not  terminate.  In  the  following  theorem  we  establish  that  this 
cumulative  probability  of  termination  is  bounded  above  by  one  (a  system  is  not  necessarily 
guaranteed  to  terminate),  and  it  equals  one  if  and  only  if  the  probability  of  traces  of  arbitrary 
length  converges  to  zero. 
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Theorem  1  Consider  a  p-language  L.  Then 

1.  E,A(L)(»)<1 

2-  E»  A(L)(s)  =  1  -s=>  limt^E|(|=/h-l(*)  =  o 

Proof:  1.  Define  S(n)  :=  S|s|<n  A(L)(s)  to  be  the  probability  of  termination  in  at  most  n 
steps  of  execution,  so  that  A(L)(s)  =  lim^^  S(n).  Then  since  S(n)  is  monotonically 
increasing,  it  suffices  to  show  that  it  is  bounded  above  by  one.  (This  implies  that  its  limiting- 
value  is  also  bounded  above  by  one.) 

We  first  show  using  induction  on  n  that  S(n )  =  1  —  ^|g|=n+1  L(s).  Clearly  this  holds  for 
n  =  0,  since  in  that  case  S(n)  =  C(e)  and  1  —  J2\s\=iL{s)  =  L(e)  —  J2aL{(r)  =  C(e).  This 
establishes  the  base  step.  For  the  induction  step  we  have 

S(n  + 1)  =  S{n)  +  Y  A(L)(s) 

\s\=n+l 

{from  induction  hypothesis  and  definition  of  A) 

=  [1-  E  £(»)]+  E  [£(«)-  Eu^)] 

|s|=n+l  |s|=n+l  a 

=  1~  Y  EL(-scj) 

|s|=n+l  a 

=  i-  E  LM, 

\s\=n+2 

This  establishes  the  induction  step. 

Now  since  S(n )  =  1  —  Z)|s|=n+r  T(s),  and  by  definition  L(s)  >  0,  it  follows  that  S(n)  <  1 
as  desired. 

2.  We  have  shown  above  that  Z)|s|<n  A(L)(s)  =  S(n)  =  l  —  Z)|s|=n+i  T(s).  So  in  the  limit 

when  n  approaches  inifinity,  this  gives  us  ]TS  A(L)(s)  =  1  —  lini/.;_>oc  X]|s|=yt  L(s),  from  which 
the  result  follows.  ■ 

We  conclude  this  section  by  defining  a  natural  partial  order  on  the  set  of  p-languages 
under  which  it  is  a  cpo.  Continuity  properties  of  various  regular  operators  defined  on  the  set 
of  p-languages  is  then  investigated  with  respect  to  this  order  in  the  next  section.  These  con¬ 
tinuity  properties  are  then  used  to  establish  that  recursive  equations  involving  the  operators 
are  well  defined. 

Definition  3  Given  a  pair  of  p-languages  K ,  L  E  C,  define  K  <  L  if  and  only  if 

Vs  G  E*  :  K{s)  <  L(s). 

It  is  easy  to  see  that  the  relation  N  as  defined  above  is  reflexive,  anti-symmetric  and  tran¬ 
sitive,  i.e. ,  it  is  a  partial  order.  The  infimum  and  supremum  of  K,L  G  C  are  defined  as 
follows: 

K  n  L(s)  =  inf (K(s),L(s));  K  U  L(s)  :=  sup (K(s),L(s)). 
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Example  2  Consider  p-languages  H,I\,L  defined  over  E  =  {a,  b}: 

H(e)  =  1  ,H(a)  =  0.4,  H(cib)  =  0.3 ,H(s)  =  0,  otherwise 

K{e)  =  1  ,K(a)  =  0.4,  K{b)  =  0.5 ,K(ab)  =  0.4,  K(s)  =  0,  otherwise 

L(e)  =  1 ,  L(a)  =  0.6,  L(b)  =  0.4,  L(cib)  =  0.3,  L(s)  =  0,  otherwise 

Then  H  N  K ,  and  H  <  L,  while  it  and  L  are  incomparable.  Note  that  L'nL  is  a  p-language. 
However,  it  UL  is  not  a  p-language,  since  K  U L(a)  =  0.6  and  K\JL(b)  =  0.5  which  violates 
P2. 


It  follows  from  the  above  example  that  the  set  of  p-languages  under  partial  order  N  is 
not  a  lattice.  The  following  theorem  shows  that  this  set  is  an  inf  semi-lattice,  and  also  a 
cpo. 

Theorem  2  Consider  the  partial  order  N  on  C  as  defined  above.  Then 

1.  (£,  N)  is  an  inf  semi-lattice. 

2.  (£,  N)  is  a  cpo. 

Proof:  1.  Consider  K ,  L  G  C.  We  need  to  show  that  K  n  L  e  C.  i.e.,  PI  and  P2  hold  for 
I\  n  L.  Clearly,  K  n  L(e)  =  inf (K(e),  L(e))  =  1,  which  implies  that  PI  holds.  In  order  to 
see  that  P2  also  holds  consider  the  following: 

Y,Kn  L(sa )  =  Y.  inf(A'(gq),  L(sa)) 

a  a 

{from  property  of  addition  and  infimum} 

<  inf  (^A'(s'a),^L(s'Cj)) 

a  a 

{from  P2} 

<  inf  (K(s),L(s))  =KnL(s). 

2.  The  bottom  element  for  £  is  the  nil  language  /  G  £.  So  next  consider  a  chain  of 
p-languages  {£,;},;> o,  i.e.,  L,  -<  Lj  whenever  i  <  j.  Then  for  each  s  G  E*,  {£,;(.§)}  is  a 
monotonically  increasing  secjuence  of  reals  bounded  above  by  1,  and  hence  converges.  We 
use  Loo  to  denote  the  resulting  “trace-wise”  limit  of  the  chain  of  p-languages.  We  need  to 
show  that  Loo  itself  is  a  p-language. 

Clearly,  L00(e)  =  lim^oo  L,;(e)  =  1,  which  implies  that  PI  holds.  In  order  to  see  that  P2 
also  holds  consider  the  following: 

ELooM  =  E  lim  Li{s°) 

l — YOO 

(T  a 

{limit  commutes  with  sum  over  finite  terms} 

=  lim  E  Li  ( sa ) 

l— >-00 

C 7 

{from  P2} 

<  lim  Lj(s)  =  Loo(s). 

i — yoo 
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4  Operators  for  Probabilistic  Languages 

In  this  section  we  define  some  operations  on  p-languages  and  study  their  properties. 
These  operators  are  useful  in  describing  a  complex  system  as  a  composition  of  many  simple 
systems. 

4.1  Choice 

This  operator  captures  non-deterministic  choice  between  two  systems.  The  composite 
system  behaves  as  one  of  the  two  systems  with  certain  given  probabilities. 

Definition  4  Given  a  pair  of  p-languages  Li,L2  G  C ,  and  e  G  [0,1],  the  choice  operation, 
denoted  Lx  +e  L2,  is  defined  as: 


/- 1  T e  f-  1  + 


where  e  :=  1  —  e. 


In  other  words,  the  combined  system  either  behaves  as  L  \  with  probability  e  or  as  L2  with 
probability  1  —  e.  As  a  part  of  the  next  theorem  we  prove  that  L\  +e  L2  is  a  p-language. 
Note  that  the  choice  operator  can  be  easily  generalized  for  multiple  p-languages,  in  which 
case  it  will  be  a  convex  combination  of  its  argument  p-languages. 

In  the  following  theorem  we  study  a  few  properties  of  the  choice  operator. 

Theorem  3  Consider  Li,  L2  G  £  and  e  G  [0, 1]. 

1.  L\  Te  L2  is  a  p-language. 

2.  A  (Li  +e  L2)  =  eA(Li)  +  e(A  (L2). 

3.  Choice  is  a  continuous  operator  in  both  of  its  arguments. 

Proof:  1.  That  L\  +e  L2  is  a  p-language  is  clear  since  it  is  the  convex  combination  of  two 
p-languages. 

2.  This  follows  from  the  following  series  of  equalities: 


A(Li  +e  L2) 


L\  +e  L2  —  A(Li  +e  L2) 

{eLi  +  eL2)  —  A (eL\  I •  eL2 ) 
{from  linearity  of  A} 

(cLi  +  e,L2)  —  [eA(Li)  +  eA(T2)] 
\e.L\  —  eA(Li)]  +  [cL2  —  eA(Z2)] 
eA(Li)  +  eA  (L2). 
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3.  By  symmetry  of  the  definition  of  the  choice  operator  in  its  arguments,  it  suffices  to 
show  that  for  any  chain  {L,}  of  p-languages  and  e  G  [0, 1]: 

R  +e  (!  =  U i(K  +e  Li). 

Note  that  U.,L.,  is  well  defined  from  Theorem  2.  Then  the  desired  equality  follows  from  the 
observation  that  for  any  s  6  S’: 

eK(s)  +  e(  lim  Lj)(s)  =  lim  [eK(s]  +  <  /.,(*)•. 

z — yoo  i—¥  oo 

where  we  have  used  the  fact  that  the  limit  operation  commutes  with  summation  over  a  finite 
number  of  terms.  ■ 

Since  choice  is  continuous  in  both  its  arguments,  it  is  possible  to  define  a  “choice  function” 
mapping  C  to  itself  and  obtain  its  infimum  fixed  point.  Given  L  €  C  and  e  G  [0, 1],  we  define 
the  function  +eL  :  C  — >■  C  as  follows: 

MH  G  C  :  +l{H)  :=L+e  H. 

Next  we  study  the  fixed  point  equation  involving  the  choice  function. 

Theorem  4  Given  L  G  C  and  e  G  [0, 1],  the  infimum  fixed  point  of  the  equation  +eL(H )  = 
H ,  i.e. ,  H  =  L  +eH,  where  H  G  C  is  the  variable  of  the  equation,  is  given  by  L  when  e  ^  0, 
and  /  when  e  =  0. 

Proof:  Since  (£,  R)  is  a  cpo,  and  the  choice  function  +eL  is  continuous  in  this  cpo,  its 
infimum  fixed  point  exists.  Note  that  when  e  =  0,  then  for  any  H  G  £,  +eL(H )  =  eL  +  eH  = 
H ,  i.e.,  each  p-language  is  a  fixed  point.  So  the  infimum  fixed  point  equals  /.  For  the  case 
when  e  ^  0,  we  proceed  as  follows. 

Note  L  +e  L  =  e.L  +  eL  =  L,  implying  that  L  is  a  fixed  point.  We  need  to  show  that  it  is 
the  infimum  fixed  point.  From  a  well  know  result  on  fixed  points  [3,  10],  we  know  that  the 
infimum  fixed  point  can  be  computed  as  Uj>o(+eT)*(/),  where  the  nil  p-language  I  is  the 
bottom  element  of  C.  Define  Hn  :=  U,;<n(+eL)*(/).  Then  it  is  easy  to  show  using  induction 
on  n  that 

n— 1 

Hn  =  J2  e*eL  +  enI. 

i= 0 

Since  (  /  0.  e  /  1 .  which  implies  en  approaches  zero  as  n  approaches  infinity.  Hence  the 
limiting  value  of  Hn  is  given  by 

Y ^  e‘eL  = - zeL  =  -eL  =  L, 

■kt  1  -  e  e 


as  desired. 
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4.2  Concatenation 

This  operator  captures  the  sequential  operation  of  two  systems.  The  composite  system 
behaves  as  one  system,  and  upon  termination  continues  to  behave  as  the  other  system  with 
a  certain  given  probability. 

Example  3  Suppose  the  p-languages  Lx  and  L2  describe  the  processes  of  tossing  a  coin 
until  a  head  and  a  tail,  respectively,  appear.  Suppose  the  probability  of  getting  a  head  in 
the  first  (resp.,  second)  process  is  pi  (resp.,  p2).  Then  the  event  set  for  both  the  systems  is 
given  by  E  =  {toss},  and 

Li(tossn)  =  (1  —  Pi)n;  A(Li)(tossn)  =  (1  -  pi)npi 

L2( toss”)  =  p2 ;  A(L2)(tossn)  =  p2  (1  -  p2). 

Next  suppose  the  first  coin  is  tossed  until  a  head  appears,  and  then  with  probability  e  the 
second  coin  is  selected  and  tossed  until  the  tail  appears.  Then  the  behavior  of  the  composite 
process  is  described  by  the  concatenation  of  the  two  processes,  which  we  denote  by  T1.eL2. 
It  is  clear  that  a  sequence  of  n  tosses  will  take  place  in  the  composite  process  if  either  (i)  the 
first  coin  is  tossed  n  times,  always  getting  a  tail,  or  (ii)  the  first  coin  is  tossed  m  <  n  times, 
always  getting  a  tail,  at  which  point  the  first  process  terminates  (a  head  occurs  next),  and 
then  with  probability  e  the  second  coin  is  selected  and  tossed  n  —  m  times,  always  getting  a 
head.  This  motivates  the  following  definition  of  concatenation. 

Definition  5  Given  a  pair  of  p-languages  L  \ .  L2  G  £,  and  e  G  [0,1],  the  concatenation 
operation,  denoted  L\.eL2,  is  defined  as: 

Vs  G  E*  :  Li.eL2(s)  :=  Lx{a)  +  A(L1)(t)L2(r1s) 

t<s 

=  Li(s)  -  eA(Li)(s)  +  A(L1)(f)L2(f_: 1s). 

t 

In  other  words,  the  composite  system  executes  a  trace  s  by  first  behaving  as  the  system  with 
p-language  Lx,  and  either  executes  the  entire  trace  s  that  way,  or  terminates  after  executing 
a  prefix  t  of  s  and  then  with  probability  e  executes  the  remainder  of  the  trace  tr 1  s  as  the 
system  with  p-language  L2. 

We  show  below  that  L\.eL2  is  indeed  a  p-language,  i.e. ,  it  satisfies  PI  and  P2.  We 
first  prove  a  few  properties  of  the  “convolution”  operation  that  appears  in  form  of  the 
term  ^■(Li){t)L2(t~1s)  in  the  definition  of  concatenation.  The  following  definition  of 
convolution  is  introduced  to  simplify  future  notation. 

Definition  6  Given  a  pair  of  real  valued  /,  g  :  E*  — >■  1Z,  their  convolution ,  denoted  /  o  g,  is 
defined  as: 

/  °9(s)  ■=  £f(t)g(t~ 1s). 

t 
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Then  the  concatenation  operation  can  be  simply  defined  as: 

Lx.eL2  =  L\  —  eA(Li)  +  eA(Lx)  o  L2. 

The  convolution  operator  defined  above  is  termed  as  product  (or  Cauchy  product)  in  the  the¬ 
ory  of  formal  power  series  [20].  We  use  convolution  to  avoid  any  confusion  with  concatenation 
product. 

The  following  lemma  describes  certain  properties  of  the  convolution  operator. 

Lemma  1  Let  /,  g ,  h  :  S*  — >■  71  be  real  valued  functions,  and  e,  e!  G  7 Z. 

1.  [1]  I  is  the  identity  for  convolution,  i.e.,  foI  =  Iof  =  f. 

2.  [1]  Convolution  is  associative,  i.e.,  /  o  (g  o  h)  =  (f  o  g)  o  h. 

3.  Convolution  is  continuous  in  both  arguments,  i.e.,  /o(U,;c/,j)  =  I  A /'■■<);).  and  (Uifi)og  = 
Uiifiog). 

4.  [1]  Convolution  is  a  linear  operator,  i.e.,  /  o  (eg  +  e'h)  =  ef  o  g  +  e'f  o  h. 

5.  If  g(e)  =  1,  then  A (/  o  g)  =  f  o  A (g)  +  A (/). 

Proof:  The  proofs  for  parts  1,  2,  and  4  can  be  found  in  [1],  So  we  only  prove  parts  3  and  5. 
3.  The  following  holds  for  each  s  G  E*: 

/°(Uj0j(s))  =  /(*)(  lim  Qiit^s)) 

t  ^°° 

{limit  commutes  with  sum  over  finite  terms  (all  prefixes  of  s) } 

i — yoc 

=  u  iifogiis)), 

The  continuity  in  the  other  argument  can  be  similarly  proved. 

5.  The  following  holds  for  any  sGS‘: 

A  (f°g){s)  =  J2f°s(S(J) 

a 

=  EE  fWgit-'sa) 

<y  t<sa 

=  Y.  Yf^)9^~1^)  +  f{sa)g(e) 

(T  |_t<S 

{since  g((sa)~l sa)  =  g(e)  =  1} 

t<s  <y  cr 

=  YfWMg)^1*)  +  A(/)(s) 

t<s 

=  f  0  Mff)(s)  +  A(/)(s). 
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In  the  following  theorem  we  study  a  few  properties  of  the  concatenation  operator. 


Theorem  5  Consider  Lx,  L2  G  C  and  e  G  [0, 1]. 

1.  Li.eL2  is  a  p-language. 

2.  A(L1.eL2)  =  eA(Li)  +  eA(Li)  o  A(L2). 

3.  Concatenation  is  continuous  in  its  second  argument;  it  is  not  even  monotone  in  its  first 
argument. 

Proof:  1.  PI  follows  from  the  following: 

Li-eL2(e)  =  Li(e)  —  eA(L\)(e)  +  eA(L\)  o  L2(e)  =  L\(e)  —  eA(L\)(e)  +  eA(L\)(e)L2(e)  =  1. 

In  order  to  prove  P2  it  suffices  to  show  that  A{L\.  eL2)  >  0.  This  follows  from  the  second 
part  below. 

2.  We  have  the  following  series  of  equalit  ies: 

A(L\.eL2 )  =  L\.eL2  —  A(L\.eL2 ) 

=  [Li  -  rA(/.,l  +  eA(Za)  o  L2\  -  A[h  -  eA(L1)  +  eA^h)  o  L2\ 

{using  linearity  of  A  and  rearranging} 

=  [Lx  -  A(Li)]  +  e[A(L1)  o  L2  -  A(Za)  +  A(A(L0)  -  A(A(Lx)  o  L2)} 

{from  Lemma  1  A(A(Li)  o  L2)  =  A(Li)  o  A(L2)  +  A(A(Li))} 

=  A  (Li)  +  e[A(Li)  o  L2  -  A^)  +  A(A  (L,))  -  A  (Lx)  o  A  (L2)  -  A(A(L1))] 
=  A(Li)  +  e[A(L!)  o  L2  -  A(h)  -  A(h)  o  A (L2)] 

{from  linearity  of  convolution} 

=  [A(Li)  -  eA(Li)]  +  eA(Li)  o  [L2  -  A(L2)] 

=  eA^J  +  eA^JoAlLa). 

3.  Consider  K  G  A  and  a  chain  of  p-languages  {L,,}.  Then  we  have  the  following  series 
of  equalities: 

/\„  I  /.,  =  /\  fi-A(I\  )  I  c  A(/\' )  o  I  jLj 

{from  continuity  of  convolution,  Lemma  1,  part  3} 

=  K  —  eA(K)  +  e  U,(  A{  A  )  o  /., 

{limit  commutes  with  sum  over  finite  terms} 

=  UtK  -  eA{K)  +  eA{K)  o  Li 
—  t — 

In  order  to  see  that  concatenation  is  not  even  monotone  in  its  first  argument,  consider 
p-languages  K ,  L  over  the  event  set  E  =  {a,  6}  with 
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K(e)  =  1  ,K(ci)  =  0.2,  K(s)  =  0,  otherwise 
L(e)  =  1  ,L(b)  =  1.0,  L(s)  =  0,  otherwise. 


Then  I  ■<  K,  but  I.\L  and  K.\L  are  incomparable  since  I.\L  =  L  and  K.\L  is  given  by: 

K.iL(e)  =  1  ,K.iL(a)  =  .2 ,K.iL(b)  =  .8,  K.iL(cib)  =  .2 ,K.±L(s)  =  0,  otherwise. 

This  completes  the  proof.  ■ 

Remark  4  In  order  to  understand  why  concatenation  is  not  continuous  in  its  first  argument 
consider  a  chain  of  p-languages  {L,  },  a  p-language  K  E  C.  and  e  E  [0, 1].  Then 

U iU.eK  =  U iU  -  eA(U iU)  +  eA(l  ,/■;)  o  K. 

Thus  for  the  concatenation  to  be  continuous  in  its  first  argument,  we  need  A(i  ,7.,)  = 
UjA(Lj),  i.e. ,  we  need  the  limiting  completion  probability  function  to  be  the  limit  of  the 
individual  completion  probability  functions.  However,  although  the  given  p-language  form  a 
chain,  their  completion  probability  functions  may  not,  and  so  their  limit  may  not  even  exist. 
This  causes  the  loss  of  continuity  of  concatenation  in  its  first  argument. 

Since  concatenation  is  continuous  in  its  second  argument,  it  is  possible  to  define  a  “con¬ 
catenation  function”  mapping  C  to  itself  and  obtain  its  infimum  fixed  point.  Given  L  E  C 
and  e  E  [0, 1],  we  define  the  function  .eL  :£—>■£  as  follows: 

MH  E  C  :  .eL(H)  :=  L.eH. 

Next  we  study  the  fixed  point  equation  involving  the  concatenation  function. 

Theorem  6  Given  L  E  C  and  e  G  [0, 1],  the  infimum  fixed  point  of  the  equation  . eL(H )  = 
H ,  i.e.,  H  =  L.eH ,  is  given  by  ]Ci>o  e*A(L)W  o  L  —  J2i>i  e*A (L)^\  where  A (L)W  represents 
the  /-fold  convolution  of  A (L)  with  itself,  and  A(L)^0^  :=  I. 

Proof:  Since  (A,  is  a  cpo,  and  the  concatenation  function  .eL  is  continuous  on  this  cpo, 
its  infimum  fixed  point  exists.  From  a  well  know  fixed  point  result  [3,  10],  the  infimum 
fixed  point  is  given  by  Uj>0(.eL)*(/),  where  /  is  the  nil  p-language  which  is  also  the  bottom 
element  of  C. 

Define  Hn  :=  U, ;<„(. eL)l(I).  We  show  using  induction  on  n  that 

n— 1  n— 1 

Hn  =  £  e*A (£)«  o  L-Y,  e?A (Lf\  (2) 

i  n  i  i 

from  which  the  result  follows  by  taking  the  limit  as  n  approaches  infinity.  For  the  base  step 
consider  n  =  1.  Then  by  definition 

H}  =  L.eI  =  L  -  eA (L)  +  eA(L)  o  I  =  L  -  eA (L)  +  eA(L)  =  L, 
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where  the  last  equality  follows  from  the  fact  that  I  is  the  identity  of  convolution  (Lemma  1). 
Also,  when  n  =  1,  (2)  reduces  to: 

Hx  =  e°A(L)(0)  o  L  =  I  o  L  =  L, 

which  establishes  the  base  step.  In  order  to  see  the  induction  step  we  have: 


H, 


n+ 1 


•  e  Hn 


L  -  eA (L)  +  e.A(L)  o  Hn 
{from  induction  hypothesis) 


n— 1 


n— 1 


=  L  —  eA(L)  +  eA(L)  o  [£  eiA(L)^  ol-^  eiA(L)w] 

2=0  2=1 
n  n 

=  L-  eA  (L)  +  ^  e*'A(L)(i)  o  L  -  £  #A(L)W 
2=1  2=2 

n  n 

=  [L  +  Ee,A(i)''’  o  L]  -  [eA(L)  +  £ 

2=1  2=2 

n  n 

=  Y,  eiA  {L)(i]  O  L-Y  <:A(I.)'"r. 

i= 0  j=l 

This  establishes  the  induction  step  and  completes  the  proof. 


Remark  5  Since  the  choice  and  concatenation  functions  as  defined  above  are  both  continu¬ 
ous,  any  function  formed  using  them  is  also  continuous  and  thus  possesses  the  infimum  fixed 
point.  As  an  example  consider  the  function  /  :  C  — »  C  defined  as  /(if)  =  L  +i  ( K.iH )  for 
each  if  G  £,  where  K ,  L  E  C  are  fixed  p-languages.  Then  its  infimum  fixed  point  can  be 
computed  using  the  technique  illustrated  above. 


4.3  Concatenation  Closure 

The  infimum  fixed  point  of  the  concatenation  function  allows  us  to  define  the  concate¬ 
nation  closure  operator.  This  describes  a  system  which  upon  termination  continues  with  a 
certain  probability  e  to  behave  as  itself  any  finite  number  of  times.  This  operation  resembles 
the  Kleene  closure  operation  from  the  theory  of  formal  languages  and  captures  the  notion  of 
repeated  sequential  behavior. 

Definition  7  Given  a  p-language  L  G  C  and  e  G  [0, 1],  the  concatenation  closure  operation, 
denoted  L*%  is  defined  as: 

L*e  :=  YeiA {L)(i)  o  L  -Ye*A (L)W  =  L  +  5>?;A(L)(?)  o  [L  -  f], 

i>0  i> 1  i> 1 

i.e. ,  it  is  the  infimum  fixed  point  of  the  concatenation  function  ,eL 


15 


Since  L*e  as  defined  above  is  the  infimum  fixed  point  of  the  concatenation  function 
(Theorem  6),  it  follows  that  it  is  a  p-language,  i.e. ,  satisfies  PI  and  P2.  The  (n  +  l)th  term 
of  the  summation  in  the  definition  of  concatenation  closure  is  enA (L)^  o  [L  —  /],  which 
represents  that  the  system  repeatedly  behaves  as  itself  n  different  times — terminating  each 
time  (represented  by  the  n-fold  convolution  term  e"A(L)^n^) — and  then  continues  as  itself 
for  the  n  +  1th  time  without  terminating  and  executing  a  “non-epsilon  trace”  (represented 
by  the  final  convolution  term  L  —  I). 

In  the  following  theorem  we  obtain  a  property  of  the  concatenation  closure  operator. 

Theorem  7  Consider  a  p-language  L  e  £  and  e  G  [0, 1].  Then  A(L*e)  =  ~e  Ei>i  e''A (L)^. 

Proof:  For  notational  simplicity  define  (eA(L))^  :=  J^i>1etA(L)^.  Then  L*e  =  L  + 
(rA(/,) )’ ' 1  c  (I.  /).  So 

A  (L*e) 

=  L  +  (eA(L))(+)  o  (L  —  I)  —  A {L  +  (eA(T))(+)  o  (L  -  /)) 

{from  linearity  of  A} 

=  L  +  (eA (L))(+)  o(L-I)-  A (L)  -  A((eA (L))(+)  o  L)  +  A((eA(L))(+)) 

{from  Lemma  1,  parts  1  and  5} 

=  L  +  [(eA(L))(+)  of  -  (eA(L))(+)]  -  A  (L)  -  [(eA(L))(+)  o  A  (L)  +  A((eA(L))(+))] 
+A((eA(L))«+>) 

=  [L-  A(L)]  +  [(eA(L))<+>  o  (L  -  A(L))]  -  (eA(£))<+> 

=  A (L)  +  (eA(L))(+)  O  A (L)  -  (eA(L))M 

=  -\e.A(L)  +  ey  e<A(L)W  o  A(L)1  -  (eA(L))(+) 
e 

=  -[eA (L)  +  ye’A(f)«]  -  (eA(L))(+) 
e  FA 

=  lEe‘A(L)«]-(eA(L))«+» 

e  i>  1 

=  [i  -  l](eA(L))<+> 

=  heA(L))<+>, 

as  desired.  ■ 

Example  4  Consider  a  process  of  tossing  a  coin  which  is  repeated  till  a  head  occurs.  Sup¬ 
pose  that  the  probability  of  getting  a  tail  is  e.  Then  in  this  example,  the  “toss  once”  process 
is  repeated  each  time  a  tail  occurs,  i.e.,  with  probability  e.  So  the  “repeated  toss”  process 
can  be  represented  as  the  concatenation-closure  of  the  “toss  once”  process.  For  the  “toss 
once”  process  E  =  {toss},  and 
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L(e)  =  L( toss)  =  1  ,L(s)  =  0,  otherwise. 

The  “repeated  toss”  process  can  be  described  by  the  p-language  L*e. 

Since  A(L)(.s)  =  1  if  and  only  if  s  =  toss,  and  zero  otherwise,  it  follows  easily  from 
induction  that  A (L)^(s)  =  1  if  and  only  if  s  =  tossn.  Suppose  we  are  interested  in  finding 
the  probability  of  n  tosses.  Then  this  is  given  by: 


L*e  (toss"  )  = 


i>0 


(tossn)  - 


5>?:A  (L)(l) 


i>  1 


(tossn) 


=  [en-1A(L)(n-1)(tossn-1)L(toss)  +  e"  A(L)(n)  (toss"  )L(e)]  -  [e"  A(L)(n)  (toss")] 


=  e 


n—  1 


e"  -  e" 


=  e 


n—  1 


which  is  precisely  the  probability  of  getting  n  —  1  consecutive  tails. 

Similarly,  if  we  are  interested  in  determining  the  probability  of  termination  after  n  tosses, 
then  this  is  given  by: 


A(L*e  )(toss") 


-5>lA(L)«(toss") 

e  /  •  i 

-e"A(L)"(toss") 

e 


which  is  precisely  the  probability  of  getting  n  —  1  consecutive  tails  followed  by  a  head. 


5  Automata  for  Probabilistic  Languages 

In  this  section  we  define  “probabilistic  automata” — nondeterministic  automata  with 
probabilities  associated  with  its  transitions,  which  can  be  used  to  represent  p-languages  in 
a  concise  manner.  It  is  easier  represent  a  stochastic  discrete  event  system  as  a  probabilistic 
automaton. 

Definition  8  A  probabilistic  automaton  (p-automaton)  G  over  the  event  set  S  is  a  triple 
G  :=  (A,:  x0,  P),  where  A"  is  the  set  of  states,  x0  G  X  is  the  initial  state,  and  P  :  X  x  E  x  X  — >■ 
[0, 1]  is  the  transition  probability  function  satisfying 

Cl:  \/x  E  X  :  Zyex  P{x,  T  ?/)<!• 

G  is  said  to  be  deterministic  if  for  each  x  G  A"  and  a  G  E,  there  exists  at  most  one  y  G  A" 
such  that  P(x,cr,y)  >  0. 
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Such  a  p-automata  starts  from  its  initial  state  x0,  and  when  at  state  x  it  moves  to  state 
y  on  event  a  with  probability  P{x,cr,y).  For  each  x  G  X,  we  define  A (G)(x)  :=  1  — 
Y.ae s  Et/ex  P(x i  y)  t°  be  the  probability  of  termination  at  state  x. 

Next  we  define  the  p-language  “generated  by”  such  a  p-automaton  G.  The  transition 
probability  function  can  be  extended  to  paths  in  G.  where  a  path  in  G  is  defined  to  be  a 
member  of  X(ZLY)*.  In  other  words,  a  path  is  obtained  by  concatenating  transitions  where 
the  end  and  start  states  of  the  consecutive  transitions  are  the  same.  Given  a  path  7 r  = 
'XqG\X\  . . .  (Tnxn ,  we  use  |7r|  =  n  to  denote  its  length;  for  each  k  <  |7r|,  nk  :=  Xq<j\X\  . . .  <Jkxk 
to  be  the  initial  sub-path  of  length  k ;  and  tr( tt)  :=  <j\ . . .  an  to  be  the  event  trace  associated 
with  path  tt.  The  probability  measure  for  paths  is  inductively  defined  as: 

Vx  e  X  :  P(x)  =  1; 

Vtt  G  A(SA)*,a  G  G  X  :  Pinery)  :=  P{iT)P{x\v\,a,y) 

In  other  words,  the  probability  measure  of  a  path  is  the  product  of  probability  measures  of 
the  individual  transitions  constituting  the  path. 

With  these  preliminaries  we  next  define  the  p-language  generated  by  a  p-automaton. 

Definition  9  Given  a  p-automaton  G  :=  (X.  xo,  P),  the  p-language  generated  by  G.  denoted 
La.  is  defined  as: 

La(s)  :=  •£  p(n 

TT:tr(Tr)=s,iT0=Xo 

The  following  proposition  states  that  L  is  indeed  a  p-language. 

Theorem  8  Let  G  :=  (X,xo ,P)  be  a  p-automaton  over  S.  Then  Lq  is  a  p-language. 

Proof:  From  definition,  we  have  ^c(e)  =  P(x 0)  =  1,  which  demonstrates  PI.  In  order  to 
see  P2,  consider  a  trace  s  G  S*  of  length,  say,  n.  Then 


Y.  La(sv) 


=  E  E  pm 

a  ir:tr(TT)=sa,iT0—xo 

=  E  E  P(7r)pixn(7t),a,y) 

a  Tt:tr(Tt)=s,TT°=xo  U&n 

=  E  P^)  E  EP(-W(7r),cr,2/) 

Tt:tr(Tt)=S,TT°  =  Xo  yen  o 

{from  Cl} 

<  E 

TT:tr(n)—s,TT°—xo 

=  Lg(S). 
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Remark  6  It  follows  that  every  p-automaton  defines  a  p-language.  Conversely,  every  p- 
language  can  be  represented  by  a  p-automaton:  Given  a  p-language  L.  define  a  p-automaton 
G  :=  (E*,e,  P),  where 


Vs,  t  G  £*,  a  G  £  :  P(s,  cr,  t)  :  = 


7t4  if  t  =  so 
L(s ) 

0  otherwise 


Since  L(s)  >  L(t)  whenever  s  <  t  (Remark  3),  in  the  definition  of  transition  probability 
P(s,a,t)  <  1.  Then  it  is  easy  to  verify  that  LG  =  L  and  that  Cl  holds  for  G.  Also  note 
that  G  is  a  deterministic  p-automaton. 


A  special  class  of  p-languages  which  is  of  interest  is  the  class  that  can  be  represented  by 
a  p-automaton  with  finitely  many  states,  which  we  call  the  class  of  regular  p-languages. 

Definition  10  A  p-language  L  G  C  is  said  to  be  regular  if  there  exists  a  p-automaton  G 
with  finitely  many  states  such  that  La  =  L. 


The  next  theorem  shows  that  the  operators  choice,  concatenation,  and  concatenation- 
closure  preserve  regularity. 

Theorem  9  Consider  regular  p-languages  Li,L2  and  e  G  [0, 1].  Then  L\  +eL2,  Li.eL2,  and 
L*{  are  also  regular  p-languages. 


Proof:  For  i  =  1,2,  let  G,  :=  (X*,  xo,i,  Pi)  be  finite  p-automata  with  LGi  =  Lt. 

In  order  to  show  the  regularity  of  L\  +e  L2,  define  a  finite  p-automaton  G  :=  (X,xo ,P), 
where  the  state  set  is  X  :=  Xi  U  X2  U  {rr0} ,  Xq  is  a  new  state  which  is  the  initial  state  of  G, 
and  the  state  transition  probabilities  are  given  by: 


Va\  y  €  X,  a  €  E  :  P(x,  o ,  y)  :=  < 


Pi(x,a,y) 
P2{x,o,y) 
ePtixo^.o.y) 
eP2(^0,2,  <j,  y) 
0 


if  x,  y  e  Xi 
if  x,  y  £  X2 
if  x  =  Xo,  y  G  A" | 
if  a;  =  .r0,  y  e  X"2 
otherwise 


Then  it  is  easy  to  see  that  G  is  a  p-automaton,  and  LG  =  Lx  +e  L2. 

Next  to  show  the  regularity  of  L1.eL2,  define  a  finite  p-automaton  G  :=  (X,  a;0,i,  P), 
where  the  state  set  X  :=  Ah  U  X2,  and  the  state  transition  probabilities  are  given  by: 


V.T,  y  G  A",  cr  G  S  :  P(x,  a,  y) 


'  Pi(x,o,  y )  if  x,  y  £  Xi 

P2{x,  cr,  y)  if  x,yeX2 

e[A(Gi)(:r)]P2(.'i’o,2,  cr ,  y)  if  x  G  Xi,  y  G  X2 
Ootherwise 


Then  it  is  easy  to  see  that  G  is  a  p-automaton,  and  LG  =  Li.eL2. 

Finally  to  show  the  regularity  of  L\e  define  a  finite  p-automaton  G  :=  (AC,  x0,i,  P),  where 
the  state  transition  probabilities  are  given  by: 

Vs,  y  G  AC,  a  G  S  :  P(.r,  cr,  y)  :=  P^x,  cr,  y)  +  e[A(Gi)(.r)]Pi(:r0,i,  cr,  y). 

Then  it  is  easy  to  see  that  G  is  a  p-automaton,  and  LG  =  L\e .  ■ 
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6  Performance  Functions 


In  order  to  study  the  performance  of  a  stochastic  discrete  event  system,  we  use  real 
valued  functions  defined  over  the  set  of  event  traces  to  associate  a  cost  with  each  trace  of 
the  system,  and  we  compute  the  average  value  of  the  cost  with  respect  to  the  completion 
probability  function.  Recall  that  the  completion  probability  function  associates  a  proba¬ 
bility  of  termination  with  each  trace  of  the  system.  Since  the  outcomes  corresponding  to 
terminations  following  distinct  traces  are  all  mutually  exclusive,  the  cumulative  probability 
of  termination  is  obtained  by  simply  adding  the  completion  probabilities  of  the  individual 
traces.  This  cumulative  probability  equals  one  if  and  only  if  the  system  is  terminating  (Theo¬ 
rem  1),  in  which  case  the  completion  probability  function  defines  a  probability  mass  function 
[5]  on  the  set  of  traces.  So  we  assume  in  this  section  that  the  systems  under  investigation 
are  terminating,  and  use  it  to  give  the  definition  of  the  average  performance  of  a  system. 

Definition  11  Given  a  terminating  system  with  p-language  L  £  £,  i.e.,  £s  A(L)(s)  =  1, 
and  a  performance  function  F  :  E*  — >■  71  representing  cost  associated  with  each  trace, 
the  average  value  of  F  with  respect  to  the  completion  probability  mass  function,  denoted 
E[F,  A(L)]  £  H,  is  given  by  E[F,  A(L)]  :=  £s  A(L)(s)F(s). 

In  the  following  subsections  we  consider  two  special  types  of  performance  functions  and 
study  how  their  values  for  complex  systems  can  be  efficiently  obtained  from  those  of  com¬ 
ponent  subsystems. 

6.1  Additive  Performance  Function 

Let  F  :  E*  — >■  1Z  be  an  “additive  performance  function”  satisfying  the  property: 

FI:  F{st)  =  F(s)  +  F(t) 

An  example  of  an  additive  performance  function  is  the  completion  time  (or  the  first  moment 
of  completion  time  if  it  is  a  random  variable  [14])  which  gives  the  time  consumed  in  execution 
of  a  trace. 

The  following  lemma  lists  a  few  properties  of  the  average  of  an  additive  performance 
function. 

Lemma  2  Consider  an  additive  performance  function  F  :  E*  — >■  71,  and  terminating  p- 
languages  Li,L2,L  £  C  having  completion  probability  functions  Ci,C2lC,  respectively. 
Then 

1.  E[F,  A(Lj)  o  A(L2)]  =  E[F,  A(Lj)]  +  E[F,  A (L2)] 

2.  E[F,A{L)W]  =  nE[F,  A(L)] 


20 


Proof:  We  only  prove  the  first  part  since  the  second  follows  from  the  first.  We  have  the 
following  series  of  equality: 


£[F,A(L!)oA(L2)]  =  ^A(L1)oA(L2)(.S)F(S) 

5 

{.s  =  £.£_1s  when  t  <  $} 

=  EE  A(L1)(f)A(L2)(r1s)]F(f.r1s) 

s  t 

{changing  order  of  summation  and  using  additivity  of  F} 

=  EEA(Li)«)A(L2)(r'S)[F(i)  +  F(rt)] 

t  s 

{applying  change  of  variables  u  :=  t  1  { 

=  ££A<mwA(£2)(«)[F(i) +  *■(«)] 

t  U 

=  ££A(L1)(t)A(L2)(11)f(f)  +  ££A(L1)(f)A(L2)(«)f(») 

t  u  t  u 

=  EAfLJWEA^KWfM+EAlLOWEAfLJWFM 

u  i  t  u 

{£A(L2)(«)  =  EA(L1)(«)  =  l} 

"U  £ 

=  E[F,A(L!)]  +  E[F,A(L2)]. 


The  result  of  the  above  lemma  is  used  in  the  next  theorem  to  obtain  average  performance 
of  a  complex  system  in  terms  of  those  of  the  constituent  subsystems. 

Theorem  10  Consider  an  additive  performance  function  F  :  S*  — >■  71,  and  terminating 
p-languages  Z^,  L2,  L  G  C. 

1.  E[F,A(L1+eL2)]  =  eE[F,A(L1)]+eE[F,A(L2)] 

2.  E[F,  A(L1.eL2)]  =  E[F,  A(Li)]  +  eE[F,  A(L2)] 

3.  E[F,  A(L*e)]  =  \E[F,  A(L)],  whenever  e  <  1 

Proof:  1.  By  Theorem  3,  A  (Li  +e  L2)  =  e.A(Li)  +  eA(L2).  So 

E[F,A(L1+eL2)]  =  ^2[eA(Li)(s)F(s)  +  eA(L2)(s)F(s)] 

=  «EA(ii)(s)f(s)  +  e£A(L2)(s)F(S) 

5  S 

=  eE^AiL^  +  eE^AiL,)]. 

2.  By  Theorem  5,  A (L\.eL2)  =  eA(Li)  +  eA(^i)  °  A (L2).  So  from  the  first  part  above, 
E[F,  A(Li.eL2)]  =  eE[F ,  A(Li)]  +eE[F,  A(Li)  oA(L2)].  Since  F  is  additive,  from  Lemma  2, 
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E[F,  A  (Li)  o  A  (L2)]  =  i?[.F,  A(Li)]  +  If[.F,  A(L2)],  from  which  the  result  follows. 

3.  By  Theorem  7,  A {L*e)  =  |  £j>i  elA(L)^\  So  from  the  first  part  of  Lemma  2, 

E[F,  A(L*e)]  =  -J2^E[F,A(L)^] 

e  i>  i 

{from  Lemma  2,  part  2} 

6  i> l 

e  i>  1 

{y:  e'  i  =  —  whenever  e  <  1} 
i>  1  e 

=  i£[f,A(L)]. 

e 

■ 

One  can  use  the  results  derived  above  to  study  the  average  completion  time  of  timed 
discrete  event  systems,  where  transition  probabilities  as  well  as  transition  occurrence  times 
are  known. 

6.2  Multiplicative  Performance  Function 

Let  F  :  E*  — >■  1Z  be  a  multiplicative  performance  function  satisfying  the  property: 

F2:  F{st)  =  F(s)F(t) 

An  example  of  a  multiplicative  performance  function  is  the  reliability  function  (or  the  first 
moment  of  the  reliability  function  if  it  is  a  random  variable)  which  gives  the  probability  that 
a  system  does  not  fail,  i.e.,  operates  reliably,  after  executing  a  trace. 

The  following  lemma  lists  a  few  properties  of  the  average  of  a  multiplicative  performance 
function. 

Lemma  3  Consider  a  multiplicative  performance  function  F  :  S*  — >■  1Z,  and  terminating 
p-languages  Li,  I/2,  L  e  C. 

1.  E[F,A(L1)oA(L2)]  =  E[F,A(L1)]E[F,A(L2)] 

2.  E[F,  A(L)W]  =  {E[F,  A(L)])n 

Proof:  Since  the  second  part  follows  from  the  first  part,  we  only  prove  the  first  part.  We 
have  the  following  series  of  equality: 

E[F,  Ai^oA^)]  =  ^2  A(l1)  o  A(l1)(s)F(s) 
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{.s  =  t.t  1  .s  when  t  <  s\ 

=  £EA(L1)(t)A(L2)(r1S)]F(«-M 

S  t 

{changing  order  of  summation  and  using  multiplicativity  of  F} 

t  S 

{applying  change  of  variables  u  :=  t~x. 5} 

=  EEA(£,)(«)A(t2)(«)[f(t)f(«)] 

t  U 

=  [EA(L1)(i)F(*)][EA(L2)(«)FM] 

t  U 

=  E[F,  A(Li)]E[F,  A(L2)]. 


The  result  of  the  above  lemma  is  used  in  the  next  theorem  to  obtain  average  performance 
of  a  complex  system  in  terms  of  those  of  the  constituent  subsystems. 


Theorem  11  Consider  a  multiplicative  performance  function  F  :  S*  — >■  77,  and  terminating 
p-languages  Li,L2,L  G  C.  Then 

1.  E[F,  A(Li  +e  L2)\  =  eE[F ,  A(Iq)]  +  eE[F,  A (L2)] 

2.  E[F,A(L1.eL2)]=eE[F,A(L1)]+eE[F,A(L1)]E[F,A(L2)] 

3.  E[F,A(L*')}  =  tS whenever  e£[F,  A(L)]  <  1. 

Proof:  1.  The  proof  of  this  part  is  the  same  as  that  for  the  first  part  of  Theorem  10. 


2.  By  Theorem  5,  A (L1.eL2)  =  eA^)  +  eA(Li)  o  A (L2).  So  from  the  first  part  above, 
E[F,  A(Li.eL2)]  =  eE[F,  A(Li)]  +eE[F,A(Li)  o  A(L2)].  Since  F  is  multiplicative,  from 
Lemma  3,  E[F,A(Li)  o  A(L2)]  =  E[F,  A(Li)]E[F,  A(L2)\,  from  which  the  result  follows. 

3.  By  Theorem  7,  A (L*e)  =  {  ^f>1  e*A {L)^\  So  from  the  first  part  of  Lemma  3, 


E[F,  A(L*e)] 


-Ee’-BpAltp] 

e  i>  1 

{from  Lemma  3,  part  2} 

e-j:e'(E{F,A(L)}f 

6  i>  1 


{Ee'(£[CA(L)])' 

i>  1 


eE\FA(L)\ 

1  -  eE[F,  A(L)\ 


whenever  eE[F,A(L)\  <  1} 


e 

e 


1 


eE[F,A(L)] 

1  -  eE[F,  A(L)] 
eE[FA(L)} 

—  eE[F,  A(L)]  ’ 
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Remark  7  The  results  of  Theorems  10  and  11  can  be  combined  to  obtain  compositional 
results  for  more  general  performance  functions  that  are  “bilinear”.  Suppose  for  example 
F(st)  =  F(s )  +  F(t)  +  2 F(s)F(t),  which  is  satisfied  by  the  second  moment  of  a  random 
completion  time  [14].  Then  using  the  techniques  of  Theorems  10  and  11  it  can  be  shown 
that 

1.  E[F,  A(Li)  o  A (L2)]  =  E[F,  A(LX)]  +  E[F,  A (L2)]  +  2 E[F,  A (Li)]£[F,  A (L2)] 

2.  £[F,  A(Li  +e  L2)]  =  e£[F,  A (L,)]  +  eE[F,  A (L2)] 

3.  £[F,  A(L1.eL2)]  =  £[F,  A(L0]  +  e£[F,  A(L2)]{1  +  2£[F,  A(Ti)]} 

7  Conclusion  and  Discussion 

In  this  paper  we  have  introduced  a  formalism  for  the  specification  and  analysis  of  qual¬ 
itative  behavior  of  stochastic  discrete  event  systems  which  are  typically  modeled  as  nonde- 
terministic  automata  with  probabilities  associated  with  transitions.  We  have  shown  that 
systems  defined  in  our  formalism  form  a  complete  partial  order  under  a  suitable  ordering 
relation.  The  regular  operators  choice,  concatenation,  and  concatenation  closure  have  been 
defined  in  this  formalism  which  allow  representation  of  complex  systems  as  combination  of 
simpler  ones.  These  operators  are  continuous  in  the  complete  partial  order  of  systems.  Con¬ 
sequently  it  is  easy  to  define  systems  using  recursions  involving  these  operators.  It  is  also 
shown  that  the  space  of  these  systems  is  a  complete  metric  space  in  which  the  operators 
are  continuous  and  contracting.  So  the  uniqueness  of  the  fixed  points  of  recursive  equations 
follows  from  the  contraction  mapping  theorem.  Various  average  performance  measures  of 
stochastic  discrete  event  systems  can  be  defined  in  this  formalism,  and  we  provide  analytical 
techniques  to  aide  their  computations. 

Our  formalism  is  also  suitable  for  studying  supervisory  control  problems  of  stochastic 
discrete  event  systems.  We  discuss  a  few  ways  in  which  control  problems  can  be  formulated 
in  this  setting.  A  detailed  examination  of  these  problems  is  subject  of  work  in  progress. 
Following  the  framework  of  supervisory  control  proposed  by  Ramadge  and  Wonham  [18], 
the  event  set  is  partitioned  into  the  set  of  controllable  and  uncontrollable  events.  The 
supervisor  is  not  allowed  to  disable  the  uncontrollable  events,  but  can  disable  any  of  the 
controllable  events.  One  possibility  is  that  the  probability  of  the  disabled  events  is  added  to 
the  termination  event  probability.  In  other  words,  the  supervisor  terminates  the  system  if 
any  of  the  disabled  event  is  chosen.  Another  possibility  is  that  the  probabilities  of  the  enabled 
events  (including  that  of  the  termination  event)  is  renormalized  using  the  probabilities  of 
the  disabled  events.  In  other  words,  the  probability  of  an  enabled  event  is  the  conditional 
probability  of  its  occurrence  subject  to  the  condition  that  certain  events  are  disabled. 

One  possible  control  objective  may  be  specified  as  a  pair  of  p-languages  K\  A  /i2.  The 
“upper”  p-language  K->  may  be  deterministic  specifying  a  legality  specification  (anything  in 
its  complement  is  considered  illegal),  whereas  the  “lower”  p-language  K\  imposes  restriction 
on  the  occurrence  probability  of  the  legal  traces,  which  must  occur  with  certain  minimum 
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probabilities.  The  goal  is  to  selectively  disable  controllable  events  so  that  the  p-language 
of  the  supervised  system  lies  within  the  range  of  the  prescribed  p-languages.  It  is  easy  to 
see  that  this  framework  generalizes  that  of  Ramadge-Wonham  and  reduces  to  the  Ramadge- 
Wonham  framework  when  all  p-languages  are  deterministic  languages. 

A  Metric  for  Probabilistic  Languages 

In  this  appendix  we  show  that  a  suitable  metric  can  be  defined  on  the  space  of  p-languages 
under  which  it  is  complete,  and  the  choice  and  concatenation  functions  are  both  continuous 
and  contracting. 

Definition  12  A  metric  d  :  C  x  C  — >■  1Z+  on  the  set  of  p-languages  is  defined  as  follows: 

VK,Le  £  :  d(K,  L)  :=  sup  | I<{s)  -  L{s)  |. 

It  is  well  known  that  d  as  defined  above  is  a  metric,  i.e.,  it  is  non-negative,  symmetric, 
satisfies  triangle  inequality,  and  its  value  for  a  p-language  pair  is  zero  if  and  only  if  the  two 
languages  are  the  same.  The  following  result  states  that  £  is  complete  under  this  metric. 

Theorem  12  (£,  d)  is  a  complete  metric  space. 

To  show  the  completeness  of  (£,  d),  we  show  that  every  Cauchy  sequence  converges. 
Consider  a  Cauchy  sequence  of  p-languages  {£*}*> q.  Since  C  is  a  subspace  of  real  valued 
functions,  which  is  a  complete  metric  space,  the  limit  of  the  Cauchy  sequence,  say  Too,  exists 
in  the  space  of  real  valued  functions.  We  need  to  show  that  L ^  G  C. 

PI  follows  from  the  fact  that: 

Too(e)  =  lim  Tj(e)  =  lim  1  =  1. 

i—¥  OO  i—¥  oo 


Next  to  see  P2  we  have: 


^  '  Lqc{S(7 )  — 


< 


V  lim  Li(sa) 

i — s*-oo 

( 7 


lim  Li(sa) 


{applying  P2  to  L, } 


lim  I/j(s) 

i—¥  oo 


Loo  (^)  • 


In  the  the  following  theorem  we  show  that  the  choice  and  concatenation  functions  are 
both  contracting. 
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Theorem  13  Consider  a  p-language  L  G  £  and  e  G  [0,1].  Then  the  functions  +eL,  ,eL  : 
£  — y  jC,  are  both  contracting  (and  hence  continuous)  whenever  e  ^  0  and  e  ^  1,  respectively. 

Proof:  Consider  K\ ,  K2  G  £.  Then  we  have 

d(+eL(iA),+eL(A2))  =  sup  \(eKi(s)  +  eL(.s))  —  (eK2(.s)  +  eL(s))\ 

S 

=  esup  \I<i{s)  -  K2{s)\ 

S 

=  ed\Ki,K2\, 

which  shows  that  +eL  is  a  contraction  whenever  e  ^  0  (so  that  e  ^  1).  Similarly, 
d(.eL(K1),.eL(K2)) 

=  sup  |(L(s)  -  eA(L)(s)  +  eA (L)  o  A'i(s))  -  {L{s)  -  e.A (L)(s)  +  eA (L)  o  K2{s))\ 

S 

{from  linearity  of  convolution} 

=  esup  |A(L)  o  (Ki(s)  -  I\2{s)) \ 

S 

=  esup  A(L)(t)[K1(t~1s)  -  K2(t~ls )] 

s  t 

{since  A’i(t_1s)  —  A'2(t_1s)  <  sup  |A'i(t_1s)  —  A'2(t_1s)|} 

t 

<  esup  sup  |A'i(t_1s)  —  K2(t~1s)\  y~]  A (L)(t) 

sit  1  t 

{since  Y,  A (L)(t)  <  1,  Theorem  1,  part  1} 

t 

<  esup  sup  {Kiit-'s)  —  K2(t~1s)\ 

S  t 

{applying  change  of  variables  u  :=  t~1s} 

=  e  sup  | K i  (u)  —  K>  (u)  | 

U 

=  ed(Ki,  K2), 

which  shows  that  .eL  is  contraction  whenever  e  ^  1.  ■ 

Remark  8  Since  choice  and  concatenation  functions  are  both  contractions  defined  over  the 
complete  metric  space  (£,  d).  it  follows  from  the  contraction  mapping  theorem  [19]  that  they 
possess  unique  fixed  points.  (The  uniqueness  of  fixed  points  is  not  guaranteed  by  lattice 
theoretic  approach  of  Theorems  4  and  6.)  The  same  holds  for  any  function  /  composed  of 
the  choice  and  the  concatenation  function.  The  unique  fixed  point  of  /  can  be  obtained  by 
taking  the  limit  of  the  sequence  of  p-languages  {£,}  defined  recursively  by  Li+ 1  =  /(Lj), 
and  L0  €  £  is  arbitrary. 
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